Lexical-based Reordering Model for Hierarchical Phrase-based Machine Translation
نویسندگان
چکیده
Reordering is a critical step for statistical machine translation. The hierarchical phrasebased model (HPB) based on a synchronous context-free grammar (SCFG) is prominent in solving global reorderings. However, the model is inadequate to supervise the reordering process so that sometimes phrases or words are reordered due to the wrong choice of translation rule. In order to solve this problem, we propose a novel lexical-based reordering model to evaluate the correctness of word order for each translation rule. Our approach employs the word alignment and translation information during the decoding process without causing too much extra computational consumption. Experimental results on the Chinese-to-English task showed that our method outperformed the baseline system in BLEU score significantly. Moreover, the translation results further proved the effectiveness of our approach.
منابع مشابه
A Reordering Approach for Statistical Machine Translation
This paper presents a Markov based hierarchical reordering scheme for lexical reordering to incorporate into phrase-based statistical machine translation system. The goal is to reorder the words and phrases in source language syntactic structure into their corresponding target language syntactic order for making translation easy. Without reordering during language translation, sentences can onl...
متن کاملHierarchical Phrase-Based Translation with Jane 2
In this paper, we give a survey of several recent extensions to hierarchical phrase-based machine translation that have been implemented in version 2 of Jane, RWTH’s open source statistical machine translation toolkit. We focus on the following techniques: Insertion and deletionmodels, lexical scoring variants, reordering extensionswith non-lexicalized reordering rules and with a discriminative...
متن کاملA Lexicalized Reordering Model for Hierarchical Phrase-based Translation
Lexicalized reordering model plays a central role in phrase-based statistical machine translation systems. The reordering model specifies the orientation for each phrase and calculates its probability conditioned on the phrase. In this paper, we describe the necessity and the challenge of introducing such a reordering model for hierarchical phrase-based translation. To deal with the challenge, ...
متن کاملHierarchical Phrase-based Machine Translation with Word-based Reordering Model
Hierarchical phrase-based machine translation can capture global reordering with synchronous context-free grammar, but has little ability to evaluate the correctness of word orderings during decoding. We propose a method to integrate word-based reordering model into hierarchical phrasebased machine translation to overcome this weakness. Our approach extends the synchronous context-free grammar ...
متن کاملA Generalized Reordering Model for Phrase-Based Statistical Machine Translation
Phrase-based translation models are widely studied in statistical machine translation (SMT). However, the existing phrase-based translation models either can not deal with non-contiguous phrases or reorder phrases only by the rules without an effective reordering model. In this paper, we propose a generalized reordering model (GREM) for phrase-based statistical machine translation, which is not...
متن کاملLearning Probabilistic Synchronous CFGs for Phrase-Based Translation
Probabilistic phrase-based synchronous grammars are now considered promising devices for statistical machine translation because they can express reordering phenomena between pairs of languages. Learning these hierarchical, probabilistic devices from parallel corpora constitutes a major challenge, because of multiple latent model variables as well as the risk of data overfitting. This paper pre...
متن کامل